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Abstract 

Background: Treated effluents from wastewater treatment works can comprise a large proportion of the flow of 
rivers in the developed world. Exposure to these effluents, or the steroidal estrogens they contain, feminizes wild 
male fish and can reduce their reproductive fitness. Long-term experimental exposures have resulted in skewed sex 
ratios, reproductive failures in breeding colonies, and population collapse. This suggests that environmental 
estrogens could threaten the sustainability of wild fish populations. 

Results: Here we tested this hypothesis by examining population genetic structures and effective population sizes 
(A/ e ) of wild roach {Rutilus rutilus L.) living in English rivers contaminated with estrogenic effluents. N e was estimated 
from DNA microsatellite genotypes using approximate Bayesian computation and sibling assignment methods. 
We found no significant negative correlation between N e and the predicted estrogen exposure at 28 sample sites. 
Furthermore, examination of the population genetic structure of roach in the region showed that some 
populations have been confined to stretches of river with a high proportion of estrogenic effluent for multiple 
generations and have survived, apparently without reliance on immigration offish from less polluted sites. 

Conclusions: These results demonstrate that roach populations living in some effluent-contaminated river stretches, 
where feminization is widespread, are self-sustaining. Although we found no evidence to suggest that exposure to 
estrogenic effluents is a significant driving factor in determining the size of roach breeding populations, a reduction 
in N e of up to 65% is still possible for the most contaminated sites because of the wide confidence intervals 
associated with the statistical model. 
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Background 

Approximately two-thirds of the worlds freshwater is 
used to dilute wastewater discharges. The demand for 
freshwater is expected to rise by 70% by 2050 [1] driving 
an urgent need to understand the impacts of treated 
waste effluent discharges on aquatic ecosystems. Waste- 
water treatment works (WWTW) effluents contain tens 
of thousands of chemicals, including pharmaceuticals 
and natural steroid estrogens that are biologically active 
at low (ng/L) exposure concentrations [2]. However, the 
long-term consequences of exposure to most of these 
chemicals on fish health and population sustainability 
are not known. 
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There is substantial evidence showing that experimen- 
tal exposure of fish to WWTW effluents and the estro- 
gens they contain can result in adverse health effects, 
including effects on reproductive development and breed- 
ing output. This has led to concerns that freshwater fish 
populations might also be affected with cascading conse- 
quences for freshwater ecosystems. Feminization of male 
fish is widespread in stretches of rivers downstream of 
WWTW outfalls as demonstrated in studies using wild 
[3,4] and caged [5,6] fish. Feminized phenotypes include 
the presence of vitellogenin, a female-specific protein in 
the blood of male fish [7] and the intersex condition: the 
presence of oocytes and/or female reproductive ducts in 
otherwise male gonads [3]. Feminization has been at- 
tributed to the presence of estrogens in effluents: estra- 
diol (E 2 ) and estrone (E^ from human excretion; 17 
alpha-ethinylestradiol (EE 2 ), a component of the female 
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contraceptive pill [8]; and a large number of other estro- 
genic chemicals from industrial and domestic effluents. 
WWTW effluents can also induce genotoxic effects [9], 
alterations in immune function [10], decreased reproduct- 
ive output [11], altered stress response [12] and changes 
in reproductive behavior [13]. 

Concern about estrogens in rivers in the United Kingdom 
drove a £40 M programme to evaluate the efficacy of vari- 
ous tertiary treatment processes in the removal of estro- 
gens [14,15]. Implementation of such processes will, 
however, incur considerable costs and a greater carbon 
footprint for WWTW [14,16], emphasising the need to 
understand better the population-level consequences for 
exposure to estrogenic and other so-called endocrine dis- 
rupting chemicals (EDCs). 

A critical question is whether chronic exposure to es- 
trogenic effluents negatively impacts the viability of wild 
fish populations, but this has been difficult to address ex- 
perimentally as it requires controlled experiments extend- 
ing over periods of several years. Limited studies suggest 
that high concentrations of EE 2 (between 3 to 6 ng/L) in 
the aquatic environment could be a threat to the sustain- 
ability of fish populations. For example, a controlled ex- 
posure of an entire lake to EE 2 in Canada resulted in the 
collapse of the fathead minnow (Pimephales promelas) 
population within three years [17]. Likewise, long-term 
(>204 days) laboratory exposures of a range of fish species 
have resulted in the absence of breeding males [18-20] 
and a three-year exposure of roach (Rutilus rutilus L.) to 
an undiluted WWTW effluent in large tanks resulted in 
an all-female population [21]. It is not known, however, if 
this occurs in rivers contaminated by effluents. Female fe- 
cundity can also be reduced through estrogen exposure, 
which can potentially reduce population growth rates [22] . 
Although the exposure concentrations in these studies 
were high compared to those typically experienced by wild 
fish populations [23], exposures to EE 2 at concentrations 
below 1 ng/L during the period of sexual development, 
have been shown to result in feminized gonads in roach 
[19] and decreased egg fertilization and female-skewed 
sex ratios in fathead minnows [24]. Evidence from wild 
roach living in UK rivers has similarly shown that femi- 
nized fish (generally less than 10% of males) with large 
numbers of eggs in their gonads have impaired semen 
quality [25] and severely (up to 76%) reduced reproductive 
success [26]. 

While these studies suggest that exposure to high con- 
centrations of effluent could threaten the viability of fish 
populations, aggregates of cyprinid fish, including roach, 
are often found in effluent contaminated rivers. How- 
ever, numbers alone may provide a misleading assess- 
ment of population sustainability as these could be sink 
populations maintained by substantial immigration from 
less contaminated locations where successful reproduction 



still occurs. Likewise, effective population sizes (N e ) - 
related to the breeding population of fish - may be de- 
creased without necessarily impacting on population sizes 
[27], as effluent exposure can affect the number of repro- 
ducing individuals and can skew reproductive success 
[21,26]. Density-dependent growth and survival can also 
play an important role [28], so a few reproducing indi- 
viduals can potentially maintain large adult population 
sizes. Indeed, studies in several species of marine fish 
with high fecundity have shown that N e can be several or- 
ders of magnitude smaller than census population sizes. 
One study found two populations of the exploited New 
Zealand snapper (Pagrus auratus) to have values of N e less 
than 1,000 despite adult census population sizes in the 
millions [29]. Similarly, a study of striped bass (Morone 
saxatilis), a freshwater fish species, found cohorts to con- 
sist of a few, full sib families, despite an adult census size 
of over 300,000 [30]. Critically, N e influences long-term 
sustainability as it determines the rate at which genetic 
diversity is lost from a population through genetic drift 
[31]. High genetic diversity increases the long-term poten- 
tial for populations to adapt to changes in the environ- 
ment and also acts to reduce the risk of inbreeding [32]. 
Small N e , however, may act to increase the chances of los- 
ing some lethal or sub-lethal mutations through genetic 
purging. 

Understanding the impact of estrogenic effluents on 
the sustainability of fish populations is, therefore, para- 
mount, but has been limited to date by the logistical 
challenges involved in undertaking long-term exposures 
to realistic effluent concentrations, and understanding 
the demographic history of wild fish populations at 
highly contaminated sites. In this study, we examine evi- 
dence for population impacts on wild roach (R. rutilus), 
a fish species in which feminization is widespread, in 
southern England. Southern England has some of the 
highest proportions of WWTW effluent in rivers known 
globally, and numerous weirs and locks which poten- 
tially confine fishes to heavily polluted stretches of river. 
We have used this system to evaluate whether stretches 
of river highly contaminated with estrogenic effluents 
have impaired breeding populations of roach. To do this 
we undertook analysis of population genetic structures 
of roach in the region using DNA microsatellite analysis. 
Microsatellite data were also used to calculate N e and es- 
timate levels of gene flow to determine the extent to 
which these populations are maintained through immi- 
gration of fish from less contaminated stretches of river. 

Results 

Genetic diversity and genetic bottlenecks 

A total of 1,769 roach, constituting 39 samples (roach 
sampled from one location within a river - typically a 
stretch of approximately 100 m - at one time point) from 
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32 different geographic locations in England were ge- 
notyped (Figure 1). Data for 14 microsatellite loci [see 
Additional file 1 for details] revealed high genetic diversity 
in all 39 samples (Table 1). Allelic richness (AR) ranged 
from 6.8 to 8.9, and expected heterozygosity (H e ) ranged 
from 0.69 to 0.75 (Table 1, see Additional file 2 for diver- 
sity statistics for each locus). Nevertheless, significant dif- 
ferences in AR (analysis of variance (ANOVA), F( 38)5 32) = 
2.1398, P = 0.00014) and observed heterozygosity (H Q ) 
(ANOVA, F (38 ,532) = 1.8677, P = 0.0017) among samples 
were detected. Roach sampled at two relatively unpolluted 
sites (LamSha and LeeHUS'95) exhibited comparatively 
low AR and LeeHyd'95 exhibited comparatively relatively 
low H Q [see Additional file 3]. No significant differences in 
H e were found. Additionally, there was evidence for gen- 
etic bottlenecks at two relatively unpolluted sites sampled 
within the rivers Arun (AruHUS) and Lee (LeeHUS), and 
at a polluted site in the Lee (LeeWhe) (Table 1). 

Population genetic structure of roach in English rivers 

We undertook an analysis of population genetic struc- 
ture in order to examine the genetic similarity of roach 



between and within catchments. Analysis of molecular 
variance (AMOVA) indicated the majority of variation 
was partitioned among individuals within river locations, 
with river location accounting for a small 2.27%, but 
highly significant proportion of the genetic variation 
(Table 2). Average pairwise F ST between roach samples 
from different catchments was 0.028 and comparisons 
were consistently highly significant (Table 3, Additional 
file 4). The population tree (Figure 2) shows distinct 
clusters of samples in different catchments: the Arun, 
the Nene, the Anglian Blackwater and the Trent, sup- 
ported with moderate-high bootstrap values (>64%). 
Samples from the Arun and the Nene also group in the 
principal component (PCA) and the STRUCTURE ana- 
lyses [see Additional file 5 and Additional file 6] demon- 
strating a distinct genetic identity of fish at these sites. 
Utilising the method of Evanno et al. [35], the inferred 
most likely number of genetically distinct clusters in the 
STRUCTURE analysis was three, comprising: the Arun, 
the two most upstream sample sites in the river Lee, and 
all remaining sites genotyped. However, from visual exam- 
ination of STRUCTURE plots run with higher levels of K 
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Figure 1 Locations of sample sites in England. (A) Modified from Williams et al. [33]. For (B) and (C), numbers in circles represent the number 
of obstructions to fish movement (either weirs or locks). Locks in the Kennet and Thames below ThaWhi have fish passes for salmon movements, 
but are likely to represent a barrier to movement of roach. In the upper river Lee, only weirs over 1 m are shown. PE = population equivalents, 
which relates to the size of the population served by the waste water treatment works. The different colours used to depict the rivers represent 
predicted mean estrogenicity (E2 equivalents in ng/L) [34]. 
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Table 1 Sampling locations, genetic diversity statistics (allelic richness {AR) and expected heterozygosity (H e )) for each 



population sample 


Sample code 3 


River 


Year 


Lat. 


Long. 


E 2 Eq (ng/L) b 


No. fish genotyped 


Bottleneck (TPM) C 


AR 




Ref. d 


Blackwater 






















BlaBIM 


Blackwater 


2010 


51.78529 


0.651013 


4.1 


55 


0.40 


7.9 


0.71 




BlaSti 


Blackwater 


2010 


51.88792 


0.606184 


7.1 


55 


0.60 


8.2 


0.72 




CheAbB 


Chelmer 


2010 


51.84596 


0.423716 


1.2 


50 


0.06 


7.7 


0.75 




Nene 






















NenBro'95 


Nene 


1995 


52.24627 


-0.77987 


1.2 


47 


0.69 


8.3 


0.72 


[3] 


NenBro'99 


Nene 


1999 


" 


" 


1.2 


47 


0.80 


8.3 


0.73 


[25] 


NenEct 


Nene 


2007 


52.24234 


-0.80281 


4.2 


51 


0.79 


8.6 


0.73 




Aire 






















AirDar 


Aire 


2011 


53.79336 


-1.54911 


2.7 


43 


0.73 


8.6 


0.75 




Arun 






















AruHor'95 


Arun 


1995 


51.05516 


-0.36197 


4.1 


54 


0.90 


7.8 


0.73 




AruHor'OO 


Arun 


2000 


51.0556 


-0.36124 


4.1 


34 


0.29 


7.6 


0.73 


[25] 


AruHor'08 


Arun 


2008 


" 




4.1 


69 


0.73 


7.9 


0.74 


[26] 


AruHUS 


Arun 


1995 


51.05953 


-0.35326 


0.2 


48 


0.01 


7.4 


0.73 


[3] 


Thames catchment 






















BlaEvH'10 


Blackwater 


2010 


e 




4.2 


41 


0.98 


8.5 


0.72 




BlaEvH'OO 


Blackwater 


2000 


51.327864 


-0.769635 


8.8 


47 


0.98 


8.3 


0.71 


[25] 


BouChe'1 1 


Bourne 


2011 


51.38086 


-0.47711 


4.8 


56 


0.62 


8.3 


0.74 




BouChe'02 


Bourne 


2002 


51.40286 


-0.54223 


5.8 


31 


0.55 


8.8 


0.75 




BouChe'06 


Bourne 


2006 


51.40330 


-0.54150 


5.8 


48 


0.25 


8.3 


0.75 


[26] 


GadCas 


Gade 


2010 


51.65893 


-0.42559 


NM f 


56 


0.38 


8.1 


0.74 




Ken Bui 


Kennet 


2010 


51.39707 


-1.28485 


0.6 


51 


0.82 


8.4 


0.74 




KenFou 


Kennet 


2010 


51.43564 


-0.97664 


8.1 


32 


0.64 


8.7 


0.75 




KenNor 


Kennet 


2010 


51.40165 


-1.33725 


0.2 


52 


0.62 


8.5 


0.73 




LamSha 


Lambourn 


2011 


51.40816 


-1.30843 


0.03 


41 


0.10 


6.8 


0.72 




LeeEss 


Lee 


2010 


51.77279 


-0.18818 


6.6 


56 


0.60 


8.3 


0.73 




LeeHyd 


Lee 


2010 


51.83958 


-0.35825 


10.3 


28 


0.36 


8.2 


0.73 




LeeHyd 


Lee 


1995 


51.84751 


-0.37111 


11.6 


44 


0.62 


7.7 


0.70 


[3] 


LeeHUS 


Lee 


1995 


51.84935 


-0.37395 


NM 


37 


0.001 


7.0 


0.73 


[3] 


LeeSta 


Lee 


2010 


51.7894 


-0.22496 


6.6 


31 


0.85 


8.4 


0.71 




Lee'00 


Lee 


2000 


g 




NM 


41 


0.90 


8.1 


0.70 


[25] 


LeeWhe 


Lee 


2010 


51.81424 


-0.28903 


6.6 


55 


0.05 


8.5 


0.75 




MolMea 


Mole 


2010 


51.19028 


-0.18581 


5.8 


42 


0.84 


8.4 


0.73 




RayRod 


Ray 


2003 


51.57093 


1.81815 


10.9 


30 


0.40 


7.7 


0.72 




StoBri 


Stort 


2010 


51.77989 


0.05024 


4.1 


52 


0.92 


8.1 


0.71 




StoTed 


Stort 


2010 


51.83115 


0.16892 


6.0 


30 


0.97 


8.2 


0.69 




ThaCul 


Thames 


2010 


51.65046 


-1.26739 


1.6 


44 


0.45 


8.8 


0.74 




ThaHam 


Thames 


2010 


51.55989 


-0.87347 


1.8 


44 


0.45 


8.2 


0.74 




ThaWhi 


Thames 


2010 


51.48662 


1 .08974 


1.5 


60 


0.88 


8.2 


0.74 




ThaSha 


Thame 


2010 


51.75281 


-1.03319 


1.9 


50 


0.38 


7.9 


0.74 




WanMoh 


Wandle 


2011 


51.40329 


0.18821 


3.3 


48 


0.33 


8.2 


0.75 
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Table 1 Sampling locations, genetic diversity statistics (allelic richness [AR) and expected heterozygosity (H e )) for each 
population sample (Continued) 

Trent 

TreWol Trent 1995 52.781178 -1.971789 3.7 45 0.55 8.0 0.72 

TreNot Trent 2007 g 24 0.50 8.9 0.75 

Full diversity statistics for each locus are given in Additional file 8 and statistical differences in genetic diversity are given in Additional file 3. a Headings in bold 
indicate different catchments; ""Estradiol equivalents, the predicted average estrogenicity at the sample site, calculated from the predicted concentrations of 
Ei, E 2 and EE 2; C TPM = two phase model for microsatellite evolution used for this test; d Source of samples; e A composite sample of fish caught at BlaEve 
(51.354119, -0.8584853) and BlaHaw (51.324119, -0.7665606). Effluent concentrations are an average between the two sites for statistical analysis; 
f NM = not modelled. For this site there are no major upstream discharges. For statistical analysis, and average E 2 Eq value for sites with no major discharges was 
used. 9 The exact locations for these sample sites are unknown. Numbers in bold indicate significance (p<0.05). 



[see Additional file 6] other possible groups are apparent 
We found no evidence that roach in the Thames catch- 
ment constitute a distinct genetic group, as samples failed 
to group together in any analysis (Figure 2, Additional file 
5 and Additional file 6). This may reflect a true lack of 
genetic distinctiveness of roach in this catchment, but 
may also result from the limited ability of the microsatel- 
lite markers used to resolve population genetic structure 
at this level 

Population genetic structure of roach within rivers and 
catchments 

Despite Thames catchment roach appearing not to 
constitute a single genetic unit, distinct from roach in 
other regions, the study did find evidence for signifi- 
cant genetic structuring in roach populations within 
the Thames catchment. This suggests the existence of 
local subpopulations exchanging a limited number of ef- 
fective migrants (breeding individuals) rather than pan- 
mixia (where all individuals are potential partners). For 
example, average F ST between samples in the Thames 
catchment was 0.022, only slightly lower than the average 
for between-catchment comparisons (0.028) for the study 
as a whole, while 262 of the 325 pairwise F S t comparisons 
in the catchment were highly significant. There was a 



weak, but significant, relationship between genetic and 
geographic distance (r 2 = 0.1089, P = 0.010) within the 
catchment, indicating a tendency for individuals to pro- 
duce offspring with fish from nearby populations rather 
than distant populations [see Additional file 7]. Addition- 
ally, the population tree (Figure 2) and PCA analyses [see 
Additional file 5] showed groups comprising samples from 
neighbouring Thames sites: three from the main Thames; 
four from the Kennet and its tributary (Lambourn); sam- 
ples from the Stort and the Lee; and samples from the 
Wandle and Mole. Samples from the Thames Blackwater 
collected in the years 2000 and 2010 (approximately two 
to three generations) clustered with very high bootstrap 
support (98%) in the population tree (Figure 2). This indi- 
cates that this roach population is largely restricted to 
this stretch of river, which includes both moderately and 
highly polluted sites, and has no substantial uncontamin- 
ated upstream stretch (Figure 1). 

Despite the proximity of some populations in the PCA 
and tree, we also found significant genetic differentiation 
between samples from some neighbouring stretches of 
the same river, sometimes occurring over small distances 
of separation (<10 km), for example, within the upper Lee 
(see below), between the Lee and the Stort, between the 
Blackwater and main Thames, between the Lambourn 



Table 2 Analysis of molecular variance (AMOVA) testing for partitioning of genetic variation among roach samples, 
grouped according to geography 



Source of variation 


d.f. 


Sum of squares 


Variance 


% Total 


P-value 


Geographical partition, location 3 












Among locations 


32 


584.063 


0.11631 


2.27 


<0.00001 


Among samples within locations 


6 


35.197 


0.00953 


0.19 


0.03226 


Within samples 


3,499 


1 7,489.296 


4.99837 


97.54 


<0.00001 


Total 


3,537 


18,108.556 


5.12421 






Geographical partition, catchment 15 












Among groups 


5 


189.529 


0.06309 


1.22 


<0.00001 


Among samples within groups 


33 


429.731 


0.08909 


1.73 


<0.00001 


Within samples 


3,499 


1 7,489.296 


4.99837 


97.05 


<0.00001 


Total 


3,537 


18,108.556 


5.15055 







a Samples from same location caught in different years are grouped; b Samples grouped by catchment (Aire, Arun, 'Blackwater and Chelmer', Nene, Thames 
and Trent). 
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Table 3 Summary of pairwise F ST and D est among roach samples (see Additional file 4 for full table of values) 

F ST D est P-value a 



Between catchments 

Thames/A run 
Thames/Nene 
Thames^rent 

Thames/Chelmer, Blackwater 
Within catchments 

Thames 

Arun 

Nene 

Blackwater/Chelmer 

Neighbouring samples in the river Lee/Stort 

LeeHUS/LeeHyd'95 

LeeHyd'10/LeeWhe 

LeeWhe/LeeSta 

LeeSta/LeeEss 

LeeEss/StoBri 

Other neighbouring stretches 

BlaBIM/BlaSti 
AruHUS/AruHor'95 



0.033, (0.002-0.074) b 
0.026, (0.007-0.076) 
0.026, (0.008-0.069) 
0.022, (0.005-0.057) 

0.022, (-0.004-0.090) 
0.007, (0.002-0.014) 
0.002, (0.000-0.003) 
0.022, (0.005-0.057) 

-0.002 
0.020 
0.009 
0.009 
0.015 

0.007 
0.002 



0.0617, (0.01 24-0.1 61 2) b 
0.0453, (0.0126-0.1458) 
0.0447, (0.0059-0.1394) 
0.0444, (0.0035-0.1191) 

0.0376, (-0.0058-0.1914) 
0.0064, (0.0015-0.0104) 
0.0010, (-0.0001-0.0020) 
0.0454, (0.0058-0.0652) 

0.0017 
0.0352 
0.0189 
0.0127 
0.0349 

0.0058 
0.0043 



0.78 
>0.00001 
>0.00001 
>0.00001 
>0.00001 

>0.00001 
0.22 



a For F ST estimate; average, (range). Pairwise values for neighbouring stretches in the Lee/Stort, the Anglian Blackwater and the Arun are also given. 



and the Kennet and within the Anglian Blackwater. In 
other cases, despite the separation of sampling locations 
by in-river impoundments such as weirs, we found no sig- 
nificant genetic differentiation between sites, for example, 
within the Stort, the main Thames, the Kennet, the Arun, 
the Nene and the Trent. Thus, patterns of within-river 
genetic structure differed between river stretches. For 
some other fish species analysis of genome-wide SNP data 
has provided greater resolution in population structure 
than that achievable using microsatellite data [37], and it 
is possible that some fine-scale genetic structure in the 
roach populations has not been detected with the micro- 
satellites used in the current study. 

Relationship between exposure to estrogenic effluents 
and effective population size 

Estimates of N e calculated from the microsatellite data 
using the approximate Bayesian computation method 
(Afe(ABC))> ranged from 54 to 301 for each sample, with 
higher precision for small N e estimates (Figure 3A). We 
found no evidence for a correlation between N e ( ABC ) and 
predicted E 2 equivalents (E 2 Eq), a measure of total estro- 
genicity of the river water due to contamination by sewage 
effluent (generalized linear models (GLM), F (1>2 o) = 0.7468, 
P = 0.40) or for an interaction between sample site and 
estrogen exposure (GLM, F (6)19 ) = 1.9954, P = 0.14) across 
the 28 sample sites where no recent restocking had oc- 
curred and had sufficient sample sizes for robust N e 



calculation. However, the 95% confidence intervals (CI) 
for the model coefficient indicated N e ( ABC ) could decrease 
by a maximum of 5.6% for each incremental increase in 
exposure of 1 ng/L E 2 Eq, or 65% at 11.6 ng/L E 2 Eq, 
equivalent to the most polluted river stretch included in 
this study. The inclusion of roach density as an additional 
covariate within the model also produced a non- 
significant result (GLM, F (1 , 16) = 1.3966, P = 0.26), albeit 
for a reduced number of sites (19). Similarly, there was 
no significant correlation between the other variables 
included in the statistical analyses (average flow rate, 
geographic/phylogenetic group and roach density) with 
A/e(ABC)- This analysis makes the assumption that immi- 
gration of fish from remote sites is limited and this is dis- 
cussed below. 

There was limited evidence for reduced variation in N e 
in roach sampled from more contaminated stretches of 
river compared to those sampled from less contaminated 
sites; all estimates above 6 ng/L E 2 Eq were below 100, 
whereas there was greater variation (54 to 301) where 
E2Eq was below 6 ng/L (Figure 3B). Estimates of N e 
using other methods were of the same order of magni- 
tude but had wider confidence intervals for each esti- 
mate; A/g(sA) ranged from 36 to 145 and also showed no 
relationship with E 2 Eq (Figure 3C). Temporal estimates 
of N et calculated from allele frequency changes over sev- 
eral generations using the Jorde and Ryman method 
[40], varied from 14 to 265, but were available for too 
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Figure 2 Neighbour-joining phylogenetic tree for the 39 roach population samples. The tree is based on the data from 14 microsatellite 

loci using chord distance from Cavalli-Sforza and Edwards [36]. Only bootstrap values above 50% are shown. Numbers at the end of sample codes 

indicate years that populations were sampled, 
k J 



few locations to make meaningful comparisons (Table 4). 
Overall, the relatively small variations in N e observed in 
this study could not be explained by any of the environ- 
mental or other variables measured in this study. 

Population genetic structure within the River Lee, a high 
effluent river 

The average proportion of effluent in the upper Lee 
downstream from Harpenden and East Hyde WWTWs 
ranges from 28% to 70% in different stretches. Signifi- 
cant genetic differentiation was detected between fish 
sampled from four of the five locations in this stretch of 
river (F ST values >0.009), and between these samples and 
two from its tributary, the Stort (F ST >0.015) shown in 
Table 3. The presence of numerous large weirs (Figure 1) 
likely confines fish to particular areas of this river; none- 
theless, samples from the Lee and the Stort did cluster to- 
gether in some analyses (Figure 2, Additional file 5). A 
sample from the upstream, unpolluted sample site (Lee- 
HUS) grouped with two samples (collected in 1995 and 
2010) from the most polluted river stretch immediately 
downstream (LeeHyd). The next sample site downstream, 
LeeWhe, was distinct from these (Figure 2, Additional 



file 5 and Additional file 6), indicating restricted move- 
ment of fish between LeeHyd and LeeWhe over at least 
three to five generations (Figure 2). Analysis using the 
program IM A 2 [42] suggested that there was less than one 
effective (breeding) migrant per generation between 
LeeHyd and LeeWhe in either direction, and about one 
migrant per generation from LeeWhe downstream to 
LeeSta (Figure 4). Collectively, these data suggest that 
roach populations at LeeWhe and those downstream do 
not rely on migration from the uncontaminated stretch of 
this river. Despite this, N e ( ABC ) estimates for these polluted 
sites in the upper Lee ranged from 70 to 84 (95% CI: 
50 to 127) compared to only 54 (95% CI: 42 to 82) for the 
upstream uncontaminated location, suggesting no sub- 
stantial impact of the effluent on the effective population 
size of these roach. 

Discussion 

In this study, analyses of population genetic structure 
via the analysis of DNA microsatellite loci identified dis- 
tinct subpopulations of roach in two tributaries of the 
Thames, the rivers Lee and the Blackwater, that were 
largely restricted to high-effluent stretches of the rivers 
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Figure 3 Effective population size (A/ e ) plotted against 
predicted estrogen exposure for 37 population samples of 
Rutilus rutilus. (A) N e calculated using the Approximate Bayesian 
Computation (ABC) method in the program OneSAMP [38]. Tests for 
homogeneity of variances: Bartlett's, P = 0.0036, Levene's, P = 0.1 7. 
(B) Results of binning analysis for data shown in A; each bin which 
encompasses all data points starting at the lower point represented 
by the mean and standard error up to, but not including, the next 
bin. (C) N e calculated using the sibling assignment (SA) method in 
Colony [39]. In A and C, error bars are 95% confidence intervals. In 
cases in which more than one population had similar values, data 
points overlie each other; thus, individual data points are not always 
visible. These plots include estimates from sample sites sampled in 
different years, for example, in the River Nene (N) (which were 
averaged for statistical analysis) and sites where recent restocking 
had occurred (open circles: River Aire (A), River Wandle (W)), which 
were excluded from the statistical analyses. 
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over multiple generations. This is despite evidence for 
widespread feminization of male fish in the studied riv- 
ers and previous evidence that feminization alters breed- 
ing capabilities [21,26]. Both of these tributaries contain 
feminized fish [25]), with predicted average exposure of 
between 4 and 9 ng/L E 2 Eq. We also found no statistically 



Table 4 Temporal estimates of effective population size 
(A/ e ) among roach samples 



Site 


Time interval 


G a 


A/ e (95% Cl) b 


N e (95% Cl) c 


Nene 


1995 to 1999 


1 


265 (66-00) 


619 (82-oo) 


AruHor 


1995 to 2000 


1 


14(8-79) 


48 (26-137) 




2000 to 2008 


2 


60 (31-1733) 


321 (106-°°) 




1995 to 2008 


3 


73 (43-247) 


232 (123-669) 


BouChe 


2002 to 2006 


1 


32 (17-219) 


51 (25-151) 




2006 to 201 1 


1 


45 (23-9572) 


63 (35-162) 




2002 to 201 1 


2 


87 (44-3897) 


495 (295-1405) 


LeeHyd 


1995 to 2010 


3 


137 (81 -00) 


346 (145-35518) 


BlaEvH 


2000 to 2010 


2 


141 (68-00) 


206 (83-°°) 



a G = assumed number of generations between sampling points; Calculated 
using the Jorde and Ryman method [40]; Calculated using the classical 
moment-based method of Waples [41]. CI, confidence interval. 

robust evidence for a substantial impact of estrogenic sew- 
age effluents on N e of roach. The possibility of a reduction 
in N e of up to 65% for roach living in the most polluted 
river stretches (E 2 Eq of 11.6 ng/L) could not be ruled out, 
due to the wide 95% confidence intervals associated with 
the statistical model. Moreover, our analysis included rela- 
tively few samples from rivers in the highest risk category, 
largely because these sites are rare. 

Caveats 

As with any modelling exercise, this analysis makes 
assumptions that may affect the interpretation of the 
results. One of these assumptions is that migration be- 
tween sites with different pollution profiles is limited over 
two to three generations, the time frame likely to have the 
greatest influence on A^abc) [43]. This was ensured by 
selecting sites with physical obstructions between them. 
However, quantifying migration rates over this timescale 
was not always possible because all potential source popu- 
lations could not be sampled and, in some cases, we found 
no significant genetic differentiation between roach at 
sites distant from one another. Genetic differentiation can 
take many generations to manifest with low levels of mi- 
gration [44]. Histological data from the Arun and the Lee 
show that feminized gonads in roach were approximately 
6-fold (Lee) and approximately 2.5-fold (Arun) more 
prevalent in populations living in the stretches down- 
stream of major WWTW inputs compared with those 
living upstream [3]. This demonstrates that migration 
in these rivers was indeed restricted to stretches delimited 
by physical barriers, despite no significant genetic dif- 
ferentiation observed between river stretches (F ST <0.002, 
Table 3). 

A second assumption is that no restocking of the riv- 
ers sampled had occurred or that the effect of restocking 
activities on N e ( ABC ) was relatively minor. Approximately 
500,000 hatchery-reared roach just over one-year-old 
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Figure 4 Posterior probabilities of migration rates, effective population sizes (A/ e ) and time since divergence estimated using IM A 2 [42] 
for two pairs of roach populations in the River Lee. (A) Estimates for parameters calculated for the LeeHyd'10 and LeeWhe and (B) for LeeSta 
and LeeWhe. These N e values are influenced by average N e since the initial split of the populations. 



(so called '1+' fish) have been introduced into the Thames 
catchment since 2000; broodstock for hatchery fish origin- 
ate from the river Trent. The influence of restocking activ- 
ities on A/e( A Bc> however, is likely to be relatively minor, as 
the sites sampled in this study were separated by major 
physical barriers from sites where these introductions had 
occurred. Moreover, introductions prior to 2000 are un- 
likely to have had a large influence on N e ( ABC ) as this is 
primarily affected by the size and variance in reproductive 
success of the parental generation, which would have 
spawned between 2004 and 2007 for most of the samples 
in this study. However, we cannot exclude some influence 
of introductions prior to 2000 as some of the summary 
statistics used to calculate Ne(ABC) are known to be af- 
fected by demographic processes over a longer time 
period [38] . The effects of introductions on genetic diver- 
sity, the detection of bottlenecks and population structure 
are likely to be greater, as these factors are affected by 
demography over many generations. However, neither the 
success of the reintroduced fish nor the size of the roach 
population in the Thames is currently known. In salmo- 
nids, restocking success is highly variable and has been at- 
tributed to local adaptation [45]. Using our microsatellite 
dataset, 73% of 48 individual roach from a stretch of the 
Wandle (restocked in 2007, 2009, 2010) assigned to the 
Thames reporting regions. Only 5% (two fish) assigned to 
the Trent (the source of the parents of introduced fish), 
which may be mis-assignments, as 5% also assigned to the 



Arun, 10% to the Anglian Blackwater and 10% to the 
Nene, from where no restocking had taken place. Thus, 
the success of the re-introduced fish may be low, but this 
requires further investigation. 

Evidence for self-sustaining populations in effluent 
contaminated rivers 

While this study does not exclude the possibility that es- 
trogenic effluents reduce N e of fish populations, it suggests 
that roach populations can be self-sustaining despite ex- 
posure to estrogens over several generations. These find- 
ings are consistent with the fact that the prevalence of 
male fish with moderate to severely feminized gonads 
(that have been shown to have substantially reduced re- 
productive competitiveness in controlled breeding studies) 
is generally less than 10% in English rivers [3,25,26,46]. 
The reproductive competitiveness of fish with the more 
common mild-intersex condition is similar to those of fish 
without gonadal feminization [26] . In roach, the gonads of 
male fish exposed to estrogens become progressively femi- 
nized with age [46], so gonadal feminization could theoret- 
ically increase N e by reducing the reproductive dominance 
of large older males in estrogen-contaminated rivers. 
While the effects on females are less well studied, female 
roach exposed to an undiluted effluent for three years in 
large tanks were able to breed, despite the fact that this ex- 
posure caused complete gonadal feminization of males 
[21]; similarly, the majority of females collected from two 
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effluent-polluted rivers examined in this study were also 
able to breed [26]. 

Population risks of long-term exposures to estrogenic 
effluents 

The results of this study on wild roach populations seem 
to contrast with studies that have assessed population 
risk through long-term exposures to estrogens, where 
exposure to concentrations between 3 and 6 ng EE 2 /L 
[17-20] or to a full-strength effluent [21] resulted in all- 
female populations and/or reproductive failure. The ap- 
parent difference between the wild populations and 
those experiments designed to simulate real world' ex- 
posure, however, may be because the fish living in the 
effluent-contaminated rivers examined in this study have 
been exposed to a lower level of estrogen or because all 
of the estrogen is not bioavailable; organic pollutants can 
bind to particulates and dissolved organic matter [47]. 
The most contaminated river in this study has a mean 
proportion of effluent of approximately 70%, although the 
majority of contaminated English rivers average approxi- 
mately 10% to 30% [34]. While EE 2 has been measured up 
to approximately 4 to 8 ng/L in English WWTW effluents 
[33,48], for the most part, they are lower [23,49], and es- 
trogen concentrations vary greatly over short periods. For 
instance, EE 2 was detected in only 21 of 135 water sam- 
ples from the Lee, although occasionally reaching 4 ng/L 
[33]. Considering the totality of estrogen content, the pre- 
dicted average estrogenicity of the most contaminated site 
in this study is 12 ng/L E 2 Eq and would be below 21 ng/L 
for 90% of the time. Only 1% to 3% of 10,313 individual 
river reaches in the UK receiving WWTW effluent were 
predicted to have average E 2 Eq >10 ng/L, and, of these, 
many are ditches composed almost entirely of sewage ef- 
fluent [34]. As E 2 is approximately 10 times less potent 
than EE 2 in inducing gonadal feminization in fish [34,50], 
it is probable that average life-time exposure to estro- 
gens in the wild does not currently reach the concen- 
trations shown to cause sex-reversal and population collapse 
in controlled experimental exposures. Green et al. [51] 
recently predicted a doubling of estrogen exposure con- 
centrations in some rivers with population growth and cli- 
mate change by 2050 suggesting an increased likelihood of 
population level effects of estrogenic effluents in the fu- 
ture, unless mitigated by substantial improvements in 
sewage treatment processes. 

Influences on the population genetic structure of roach 

The population genetic structure of roach in southern 
England observed in this study may have been influenced by 
historical biogeography, migratory behaviour, human trans- 
locations and in-river barriers. Roach can be highly mobile 
and can migrate over 10 km, particularly in the spawning 
period April to June, if migration is not obstructed [52]; 



additionally, there is some evidence that roach show fidel- 
ity in migration and return to spawning sites they have 
used previously [53]. Within the Thames catchment, the 
observed population genetic structure likely results, at 
least in part, from the large number of physical barriers, 
such as weirs and locks (Figure 1); these have been recog- 
nized as major factors restricting movement (including 
downstream) of roach [54]. Similarly, the importance of 
barriers in driving intra-catchment genetic variation is 
well documented in other fish species, for example, brown 
trout [55]. Only obstructions in the main River Thames 
and the Kennet are equipped with fish passes and, al- 
though some passes can be used by roach [56], the effect- 
iveness of these passes in allowing fish movement has not 
been studied. As we identified significant genetic differen- 
tiation between roach from the Kennet and the Thames, 
despite being connected by fish passes, these passes may 
represent major physical separation barriers to this fish 
species. 

Conclusions 

Despite the widespread feminization of male roach in 
effluent-contaminated rivers of southern England, using 
nuclear DNA microsatellites we were able to identify some 
populations that have been confined to stretches of river 
with moderate to high exposure to estrogenic effluents 
over multiple generations. We also found no evidence of a 
correlation between the N e of roach populations and pre- 
dicted exposure to estrogens, although because of the 
wide confidence intervals, a reduction in N e of up to 65% 
is still possible at the most contaminated sites. 

Methods 

Study location 

Southern England, particularly the region within the 
Thames catchment, was chosen for this study for four rea- 
sons. Firstly, it is a densely populated region with relatively 
low rainfall and, therefore, includes some river stretches 
with some of the highest concentrations of WWTW efflu- 
ents in the United Kingdom [34]. Secondly, feminization 
of roach has been widely reported in the region [3,46]. 
Thirdly, many rivers in the region have locks, dams or 
weirs which are likely to limit movement of fish species 
between stretches of river with different pollution profiles. 
Fourthly, the effluent concentrations and risk of estrogenic 
endocrine disruption have been modelled [34], Sample 
sites are shown in Figure 1 and Table 1 and were selected 
to span the full range of predicted estrogen concentrations 
in English rivers and where obstructions are likely to re- 
strict fish movements [3,46]. 

Roach study species 

Roach was selected as the study species because it is native 
and widely distributed in the United Kingdom, including 
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in rivers polluted with WWTW effluents. Additionally, 
widespread feminization has been reported in wild popula- 
tions and with a proven association with exposure to estro- 
genic effluents [14,19,21,57]. Roach generally reach sexual 
maturity between two and three years and spawn annually 
in the spring. Adult roach can migrate considerable dis- 
tances, but where weirs obstruct upstream and down- 
stream movement they are able to complete their lifecycles 
in a single stretch of river [54]. 

Population-genetic analyses 

To understand the extent to which roach populations 
are restricted to various stretches of river, several ap- 
proaches were used to investigate population genetic struc- 
ture. We analysed microsatellite loci variation in 1,769 fish 
sampled between 1995 and 2011. Each fish was genotyped 
at between 14 to 19 microsatellite loci. Microsatellite geno- 
types are provided in Additional file 8. Protocols for DNA 
extraction and details of amplification of the microsatellite 
loci are illustrated in Additional file 1. Data for 14 
microsatellite loci were used to calculate three mea- 
sures of genetic diversity: observed heterozygosity (H Q ) 
and expected heterozygosity (H e ) using GenAlEx 6 
[58]; allelic richness (AR) was calculated using Fstat 
v2.9.3 [59] - see Additional file 1 for full details. The 
programme BOTTLENECK [60,61] was used to test for 
recent genetic bottlenecks. This programme tests for a 
relative excess in heterozygosity that is apparent for a few 
generations after a bottleneck and develops because allelic 
diversity declines faster than heterozygosity, due to loss of 
rare alleles. Pairwise genetic differentiation between the 
sampled sites was estimated using F ST) calculated using 
Arlequin 3.5 [62] and Josts D, D est [63], calculated using 
SMOGD [64]. The significance of the F ST estimates was 
assessed based on 10,000 permutations. AMOVA was per- 
formed using Arlequin. In order to test whether fish are 
more likely to produce offspring with local mates, com- 
pared to mates in geographically distant locations within 
the Thames catchment, isolation by distance analysis was 
performed using the Mantel test [65] in GenAlEx 6 [58]. 
Genetic similarity between populations was investigated 
using population based trees, calculated in POPULA- 
TIONS, vl.2.30beta [66], PCA in GenAlEx 6 [58] and a 
Bayesian clustering approach in STRUCTURE [67]. Fi- 
nally, the program IM A 2 [42,68] was used to estimate mi- 
gration rates between adjacent populations within high 
effluent stretches of the Lee, giving relatively high pairwise 
F ST values (LeeHyd, LeeWhe, and LeeSta). See Additional 
file 1 for further details. To investigate the influence of 
restocking, genetic assignment of fish from the Wandle 
was undertaken using the 'leave one out test' in the com- 
puter program, ONCOR [69], based on their micro- 
satellite genotypes. The reporting regions comprised: 
(1) the Wandle, (2) Lee/Stort, (3) rest of the Thames, 



(4) Trent, (5) Nene, (6) Arun, (7) Chelmer and (8) Anglian 
Blackwater. All animals used in this research were treated 
humanely and with regard for the alleviation of suffering; 
all procedures were subject to approval by the local 
ethical review process as required under the U.K. Animals 
(Scientific Procedures) Act (1986). 

Effective population size 

To test whether WWTW effluents substantially reduce 
the size of breeding populations, effective population 
sizes (A/g), which relate to the number of breeding fish 
and skews in breeding success, were estimated using the 
microsatellite genotypes. We compared N e from sites 
ranging from little/no upstream WWTW effluent inputs 
to those where the majority of the flow can comprise 
WWTW effluent. Two single sample (generation) methods, 
that use different aspects of the microsatellite data, were 
used to estimate N e for each population; the Approximate 
Bayesian Computation (ABC) method using ONeSAMP 
1.2 [38], hereafter referred to as N e ( ABC y, and the sibling 
assignment method (SA), N e ($A) [39]. Temporal estimates 
for Ne, which are calculated from the change in allele fre- 
quencies between generations, were also estimated for 
sites where fish had been sampled more than once using 
TempoFs [40] and NeEstimator [70]. For further details 
see Additional file 1. 

Statistical analysis 

GLM were used to examine the relationship between 
predicted exposure to estrogenic effluents and N e ( AB c)« 
For further details see Additional file 1. Differences in 
genetic diversity among sampled populations were tested 
using ANOVA. All statistical analysis was performed 
using the software R 2.13.0 [71]. 
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Additional file 1: Details of microsatellite genotyping methods, 
population-genetic analysis and statistical analysis. 

Additional file 2: Genetic diversity estimates for each locus in each 
population. 

Additional file 3: Graph showing statistical differences in genetic 
diversity among roach Rutilus rutilus populations. 

Additional file 4: Pairwise F ST and D est among roach population 
samples. 

Additional file 5: Multidimensional scaling plots of pairwise D A 
distances [72], 

Additional file 6: Structure analyses plots using the locprior model 
and analysis of optimum number of genetic units in Structure 
Harvester [73]. 

Additional file 7: Correlation between genetic distance and 
geographic distance (km) between pairs of sites for 24 population 
samples from the Thames catchment. 

Additional file 8: Microsatellite genotypes for 1,769 roach 
genotyped in this study. 
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AMOVA: analysis of molecular variance; ANOVA: analysis of variance; 
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17 alpha-ethinylestradiol; GLM: Generalized Linear Models; H e : expected 
heterozygosity; HWE: Hardy-Weinberg equilibrium; MCMC: Markov chain 
Monte Carlo; MEGA: Molecular Evolutionary Genetic Analysis; N b : effective 
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size estimate calculated using the sibling assignment method; 
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